Evaluation of Spectral Tilt Measures for Sentence Prominence Under Different Noise Conditions
نویسندگان
چکیده
Spectral tilt has been suggested to be a correlate of prominence in speech, although several studies have not replicated this empirically. This may be partially due to the lack of a standard method for tilt estimation from speech, rendering interpretations and comparisons between studies difficult. In addition, little is known about the performance of tilt estimators for prominence detection in the presence of noise. In this work, we investigate and compare several standard tilt measures on quantifying prominence in spoken Dutch and under different levels of additive noise. We also compare these measures with other acoustic correlates of prominence, namely, energy, F0, and duration. Our results provide further empirical support for the finding that tilt is a systematic correlate of prominence, at least in Dutch, even though energy, F0, and duration appear still to be more robust features for the task. In addition, our results show that there are notable differences between different tilt estimators in their ability to discriminate prominent words from nonprominent ones in different levels of noise.
منابع مشابه
Great Expectations – Introspective vs. Percept their Acoustic Correl
In order to gain knowledge about the interaction between topdown expectations of listeners concerning prosodic prominence and its acoustic correlates, two exploratory empirical studies were carried out. First, native and nonnative subjects rated prominences of speech read at normal and very fast —prosodically very different — speech. Later, these ratings were compared with introspective promine...
متن کاملAcoustic Correlates of Lexical Stress in Persian
This paper examines the effects of lexical stress on intensity and duration in Persian both in the presence of the intonational prominence contrast and in the abstraction from the compounding accent condition. A production study was conducted in which 10 speakers produced Persian lexical and reiterant disyllabic minimal stress pairs spoken with and without an accent in a fixed carrier sentence....
متن کاملAcoustic Correlates of Glottal Gaps
During speech production, the vocal folds may not close completely. The resulting glottal gap (GG) or incomplete glottal closure has not been systematically studied in terms of GG acoustic and/or perceptual consequences. This paper uses high-speed imaging to investigate the relationship between GG area, source parameters, acoustic measures, and voice quality for 6 subjects. Results showed that ...
متن کاملSpectral tilt modelling with GMMs for intelligibility enhancement of narrowband telephone speech
In mobile communications, post-processing methods are used to improve the intelligibility of speech in adverse background noise conditions. In this study, post-processing based on modelling the Lombard effect is investigated. The study focuses on comparing different spectral envelope estimation methods together with Gaussian mixture modelling in order to change the spectral tilt of speech in a ...
متن کاملSNR loss: A new objective measure for predicting the intelligibility of noise-suppressed speech
Most of the existing intelligibility measures do not account for the distortions present in processed speech, such as those introduced by speech-enhancement algorithms. In the present study, we propose three new objective measures that can be used for prediction of intelligibility of processed (e.g., via an enhancement algorithm) speech in noisy conditions. All three measures use a critical-ban...
متن کامل